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Abstract 

The best known inner bound for the 2-user discrete memoryless interference channel is the Han-Kobayashi rate 
region. The coding schemes that achieve this region are based on rate-splitting and superposition coding. In this 
paper, we develop a multicoding scheme to achieve the same rate region. A key advantage of the multicoding nature 
of the proposed coding scheme is that it can be naturally extended to more general settings, such as when encoders 
have state information or can overhear each other. In particular, we extend our coding scheme to characterize the 
capacity region of the state-dependent deterministic Z-interference channel when noncausal state information is 
available at the interfering transmitter. We specialize our results to the case of the linear deterministic model with 
on/off interference which models a wireless system where a cognitive transmitter is noncausally aware of the times 
it interferes with a primary transmission. For this special case, we provide an explicit expression for the capacity 
region and discuss some interesting properties of the optimal strategy. We also extend our multicoding scheme to 
find the capacity region of the deterministic Z-interference channel when the signal of the interfering transmitter 
can be overheard at the other transmitter (a.k.a. unidirectional partial cribbing). 

Index Terms 

Interference channel. Multicoding, Z-interference channel. Partial Cribbing, State Information 


I. Introduction 

The discrete memoryless interference channel (DM-IC) is the canonical model for studying the effect 
of interference in wireless systems. The capacity of this channel is only known in some special cases e.g. 
class of deterministic ICs |3j, 0, strong interference conditions @ _ 0> degraded conditions |8j, [|9| and 
a class of semideterministic ICs [10||. Characterizing the capacity region in the general case has been one 
of the long standing open problems in information theory. The best known achievable rate region is the 
so-called Han-Kobayashi scheme, which can be achieved by using schemes that are based on the concepts 
of rate-splitting and superposition coding [IT], JT2| . Rate-splitting refers to the technique of splitting the 
message at a transmitter into a common and a private part, where the common part is decoded at all the 
receivers and the private part is decoded only at the intended receiver. The two parts of the message are 
then combined into a single signal using superposition coding, first introduced in [T3j in the context of 
the broadcast channel. In all the special cases where the capacity is known, the Han-Kobayashi region 
equals the capacity region. However, it has been very recently shown that this inner bound is not tight in 
general 0. 

The first result we present in this paper is to show that the Han-Kobayashi region can be achieved by 
a multicoding scheme. This scheme does not involve any explicit rate-splitting. Instead, the codebook at 
each encoder is generated as a multicodebook, i.e. there are multiple codewords corresponding to each 
message. The auxiliary random variable in this scheme does not explicitly carry a part of the message, 
rather it implicitly carries some part of the message, and it is not required to specify which partQ In this 
sense, it’s role is different from that in the Han-Kobayashi scheme |TTJ, |12j, and is reminiscent of the 
encoding for state-dependent channels in [|T6|, and the alternative proof of Marion’s achievable rate region 
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for the broadcast channel given in [17]. A key advantage of the multicoding nature of the new scheme 
is that it can be easily extended to obtain simple achievability schemes for setups in which the canonical 
interference channel model is augmented to incorporate additional node capabilities such as cognition and 
state-dependence, while extending the original Han-Kobayashi scheme to such setups can quickly become 
highly involved. We demonstrate this by constructing schemes for settings which augment the canonical 
interference channel model in different ways. 

The first setting we consider is when the interference channel is state-dependent and the state-information 
is available non-causally to one of the transmitters (cognitive transmitter). For simplicity, we focus on 
the case when the cross-link between the non-cognitive transmitter and its undesired receiver is weak 
enough to be ignored, giving rise to the so called Z-interference channel topology. We know that for 
a point-to-point state-dependent channel with non-causal state information at the encoder, the optimal 
achievability scheme due to Gelfand and Pinsker uses multicoding at the encoders. Hence, for state- 
dependent interference channels with noncausal state information at the encoders too, we would like 
to use the idea of multicoding. Since the new achievability scheme that we present for the canonical 
interference channel already involves multicoding, it requires almost no change to be applicable to the 
state-dependent setting. Apart from being simple, we are also able to prove its optimality for the case of 
the deterministic Z-intcrfcrcncc channel. 

We then specialize our capacity characterization for the state-dependent deterministic Z-interference 
channel to the case where the channels are governed by the linear deterministic model of [T8J. In the recent 
literature, this model has proven extremely useful for approximating the capacity of wireless networks and 
developing insights for the design of optimal communication strategies. We consider a linear deterministic 
Z-interference channel, in which the state of the channel denotes whether the interference link is present 
or not. When the transmitters are base-stations and the receivers are end-users, this can model the scenario 
where one of the transmitters is cognitive, for example it can be a central controller that knows when the 
other Tx-Rx pair will be scheduled to communicate on the same frequency band. When the two Tx-Rx pairs 
are scheduled to communicate on the same frequency band, this gives an interference channel; when they 
communicate on different frequency bands each pair gets a clean channel free of interference. Moreover, 
the cognitive transmitter can know the schedule ahead of time, i.e. the times at which its transmission will 
be interfering with the second Tx-Rx pair. For this special case, we identify auxiliary random variables 
and provide an explicit expression for the capacity region. This explicit capacity characterization allows 
us to identify interesting properties of the optimal strategy. In particular, with single bit level for the 
linear deterministic channels (which would imply low to moderate SNR for the corresponding Gaussian 
channels), the sum rate is maximized when the interfering transmitter remains silent (transmits 0’s) at times 
when it interferes with the second transmission. It then treats these symbols as stuck to 0 and performs 
Gelfand-Pinsker coding. The second transmitter observes a clean channel at all times and communicates 
at the maximal rate of 1 bit per channel use. This capacity characterization also reveals that when all 
nodes are provided with the state information the sum-capacity cannot be further improved. Thus, for 
this channel, the sum-capacity when all nodes have state information is the same as that when only the 
interfering encoder has state information. 

Motivated by wireless applications, there has been significant recent interest in state-dependent inter¬ 
ference channels (ICs), where the state information is known only to some of the transmitters. Given the 
inherent difficulty of the problem, many special cases have been considered [ 191—[22), for which different 
coding schemes have been proposed. However, exact capacity characterizations have proven difficult. 
Another line of related work has been the study of cognitive state-dependent ICs p3|-[26|. Here, the 
term “cognitive” is usually used to mean that the cognitive transmitters know not only the state of the 
channel but also messages of other transmitters. Note that this assumption is significantly stronger than 
assuming state information at the transmitter as we do here. 

The second setting we consider is when one of the transmitters has the capability to overhear the signal 
transmitted by the other transmitter, which can be used to induce cooperation between the two transmitters. 
This is different from having orthogonal communication finks (or conferencing) between the encoders, 
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as studied in [27]. Instead, overhearing exploits the natural broadcasting nature of the wireless medium 
to establish cooperation without requiring any dedicated resources. A variety of different models have 
been used to capture overhearing j28j-[30|, and are known by different names such as cribbing, source 
cooperation, generalized feedback, cognition etc. We use ’’partial cribbing” to model the overhearing, in 
which some deterministic function of the signal transmitted by the non-cognitive transmitter is available at 
the cognitive transmitter in a strictly causal fashion. Again, for simplicity, we focus on the case of the Z- 
interference channel, where the cross-link between the non-cognitive transmitter and its undesired receiver 
is weak enough to be ignored. For this setting, we develop a simple achievability scheme by combining 
our multicoding-based scheme with block-Markov coding and show that it is optimal for deterministic 
configurations. 

Finally, to further illustrate the point that simple schemes can be obtained for augmented scenarios, we 
describe two extensions which introduce even more complexity in the model. In the first extension, a third 
message is introduced in the state-dependent Z-interference channel, which is to be communicated from 
the interfering transmitter to the interfered receiver. The second extension combines the state-dependent 
Z-IC and the Z-IC with unidirectional partial cribbing. In both extensions, we are able to obtain simple 
optimal schemes by naturally extending the multicoding-based achievability schemes. 


Organization 

We describe the models considered in this paper formally in Section [n| The alternate achievability 


scheme that achieves the Han-Kobayashi region is presented in Sections hnl Section IV describes the 
results concerning the state-dependent setup and section [V] describes the results concerning the cribbing 
setup. The two extensions are described in Section [VI] and we end the paper with a short discussion in 
Section IVII1 


II. Model 

Capital letters, small letters and capital calligraphic letters denote random variables, realizations and 
alphabets respectively. The tuple (x(l), x(2 ),..., x(n)) and the set {a, a + 1,..., b} are denoted by x n 
and [a : b] respectively, and Te stands for the e-strongly typical set of length-?;, sequences. 

We now describe the channel models considered in this paper. 

A. Canonical Interference Channel 

The two-user discrete memoryless interference channel py 1 ,y 2 |Xi,x 2 (?/i) x 2 ) is depicted in Fig. |T[ 

Each sender j e {1,2} wishes to communicate a message Mj to the corresponding receiver. 

A (n, 2 nRl , 2 nR2 , e) code for the above channel consists of the encoding and decoding functions: 

fj,i : [1 : 2 nRj ] —>• Xj, je{l,2},l < i < n, 

9j : y; -> [1 : 2 "'C j e {1,2}, 

such that 

Pr {gO'j 1 ) 7^ Mj} < e, j G {1,2}, 

where M x and M 2 are assumed to be distributed uniformly in [1 : 2 nRl ] and [1 : 2 nR ' 2 ] respectively. A 
rate pair (i? 1 ,i? 2 ) is said to be achievable if for every e > 0 , there exists a (n, 2 nRl , 2 ni?2 , e) code for 
sufficiently large n. The capacity region is defined to be the closure of the achievable rate region. 
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Fig. 1: Two-User Discrete Memoryless Interference Channel (DM-IC) 


B. State-Dependent Z-Interference Channel 

The discrete memoryless Z-interference channel p(yi\x 1: s)p(y 2 \xi,x 2 , s) with discrete memoryless state 
p(s) is depicted in Fig. [2j The states are assumed to be known noncausally at encoder 1. Each sender 
j e {1,2} wishes to communicate a message Mj at rate R 3 to the corresponding receiver. For this setting, 
a (n, 2 nRl , 2 ni?2 , e) code consists of the encoding and decoding functions: 

fi,i : [1 : 2 nRl } 1 < ^ < n, 

h,i : [1 : 2 ni?2 ] X 2 , 1 < i < n, 

gj : y'j ~X [1 : 2 nRj ], JE{ 1,2}, 

such that 

Pr{g(Yp)^Mj}<e, je{ 1,2}. 

The probability of error, achievable rate pairs (Ri,R 2 ) and the capacity region are defined in a similar 
manner as before. 


Mi 


M 2 



Mi 


M 2 


Fig. 2: The State-Dependent Z-Interference Channel (S-D Z-IC) 


The deterministic S-D Z-IC is depicted in Fig. [3] The channel output Y\ is a deterministic function 
yi(Xi, S ) of the channel input A"! and the state S. At receiver 2, the channel output Y 2 is a deterministic 
function y 2 (X 2 , T x ) of the channel input X 2 and the interference T ,, which is assumed to be a deterministic 
function t\(X 1 , S ). We also assume that if x 2 is given, y 2 (x 2 ,ti) is an injective function of f 1; i.e. there 
exists some function g such that t x = g(y 2 ,x 2 ). 

We consider a special case of the injective deterministic S-D Z-IC in detail, which is the modulo- 
additive S-D Z-IC, depicted in Fig. |4} All channel inputs and outputs come from a finite alphabet X = 
{0,1,..., | X\ — 1}. The channel has two states. In state S — 0, there is no interference while in state 
S — 1, the cross-link is present. When the cross-link is present, the output at receiver 2 is the modulo-A’ 
sum of X 2 and X x . For all other cases, the output is equal to the input. We can describe this formally as: 

Vj = X,, 

r 2 = x 2 ® (S ■ ATi). 

Assume that the state S is i.i.d. Ber(A). A generalization of this model that incorporates multiple levels 
is also considered subsequently. 
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Fig. 3: The Injective Deterministic S-D Z-IC 



Fig. 4: The Modulo-Additive S-D Z-IC. All channel inputs and outputs take values in the same finite alphabet X. The state S is Ber(A). 


C. Z-Interference Channel with Partial Cribbing 

The discrete memory less deterministic Z-interference channel is depicted in Fig. [5j The channel output 
Y\ is a deterministic function y\ (Xj) of the channel input X x . At receiver 2, the channel output Y 2 is a 
deterministic function y 2 (X 2 , 7j ) of the channel input A" 2 and the interference Tj, which is assumed to be 
a deterministic function ti(Xi). We also assume that if x 2 is given, y 2 (x 2 , ti ) is an injective function of t\, 
i.e. there exists some function g such that t\ = g(y 2 ,x 2 ). Each sender j G {1,2} wishes to communicate 
a message Mj at rate Rj to the corresponding receiver. 

We assume that encoder 1 can overhear the signal from transmitter 2 strictly causally, which is modeled 
as partial cribbing with a delay [311. The partial cribbing signal, which is a function of X 2 is denoted by 
Z 2 . So is a function of (Mi, Zlf 1 ) and X 2i is a function of M 2 . 
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Fig. 5: Injective Deterministic Z-Interference Channel with Unidirectional Partial Cribbing 
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A (n,2 nRl ,2 nR2 ,e) code for this setting consists of 

/m : [1 : 2 nRl ] x zr 1 ->X 1} 1 < i < n, 

f r 2 ,i : [1 : 2 nR2 } ^* 2 , 1 <i<n, 

9j : yj [1 : 2 nRj ], j G {1,2}, 

such that 

Pr {g(Y?) ? M,} < e, j€{ 1,2}. 

The probability of error, achievable rate pairs (Ri,R 2 ) and the capacity region are defined in a similar 
manner as before. 


A. Preliminaries 


III. Canonical Interference Channel 


The currently best known achievable rate region for the 2-user DM-IC was provided by Han and 
Kobayashi in [11|, using a scheme based on rate-splitting and superposition coding. An alternative 
achievable rate region that included the Han-Kobayashi rate region was proposed in [ JT2| , using another 
scheme that used rate-splitting and superposition coding. Using the terminology introduced in p2j, the 
encoding in |TT} can be described as employing homogeneous superposition coding, while that in [ |12[ | can 
be described as employing heterogeneous superposition coding. It was then proved in [33j that the two 
regions are, in fact, equivalent and given by the following compact representation (see also [3_4j, [35]). 

Theorem 1 (Han-Kobayashi Region). A rate pair (R 1 ,R 2 ) is achievable for the DM-IC p(lj\, y 2 \x^ x 2 ) if 



Ri 

< 

I(Xil 

\Yi\U 2 ,Q), 





R-l 

< 

I{X 2 \ 





Ri 

+ R-2 

< 

I(X n 

.Y^lhiU^Q) 

+ I{X 2 ,U 1 ; 

Y 2 \Q), 


Ri 

+ R.2 

< 

I{Xl: 

< U 2 ; YfUi, Q) 

+ I{x 2 , Up, 

Y 2 \u 2 ,Q), 


Ri 

+ R-2 

< 

l(Xl; 

V.-Y^Q) + I{X 2 -Y 2 \U u 

u 2 ,q), 


2R 1 

+ R-2 

< 

I{x i; 

.YfU^U^Q) 

+ I(X 2 , Up 

Y 2 \U 2 ,Q) + I{X 1 ,U 2 ; 

Yi\Q), 

Ri + 2R 2 

< 

I{X 2 \ 

\Y 2 \u x ,u 2 ,Q) 

+ I(X U U 2 , 

Y 1 \U l ,Q) + I{X 2 ,U 1 ; 

Y 2 \Q), 


for some pmf p(q)p(ui,xi\q)p(u 2 ,x 2 \q), where \U\\ < |Aj| +4, \U 2 \ < \X 2 \ +4 and \Q\ < 4. 


B. Outline of the new achievability scheme 

We first describe the alternative achievability scheme informally and discuss the similarities and dif¬ 
ferences with the existing achievability schemes. The later subsections describe and analyze the scheme 
formally. 

Encoder j, where j 6 {1,2} prepares two codebooks: 

• A transmission multicodcbook^J which is a set of codewords {x”(-, ■)} formed using the transmission 
random variable Xj. This set is partitioned into a number of bins (or subcodebooks), where the bin- 
index corresponds to the message, 

. A coordination codebook which is a set of codewords {«"(•)} formed using the auxiliary random 
variable Uj. 

Given a message, one codeword x” from the corresponding bin in the transmission multicodebook is 
chosen so that it is jointly typical with some sequence vf in the coordination codebook. The codeword 
x™ so chosen forms the transmission sequence. 

2 The term “multicodebook” refers to the fact that there are multiple codewords corresponding to each message. 
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At a decoder, the desired message is decoded by using joint typicality decoding, which uses the coordi¬ 
nation codebook and the transmission multicodebook of the corresponding encoder and the coordination 
codebook of the other encoder. Thus, a receiver makes use of the interference via its knowledge of the 
coordination codebook at the interfering transmitter. 

From the above description, it can be seen that the coordination codebook does not carry any message. 
Its purpose is to ensure that the transmission sequence from a given bin is well-chosen, i.e. it is beneficial 
to the intended receiver and also the unintended receiver. To the best of our knowledge, this is the first 
time an auxiliary random variable (which is not the time-sharing random variable) appears in one of the 
best known achievability schemes without being explicitly associated with any message. 

C. Achievability scheme 

Choose a pmf p(«i, xi)p(w 2 , t 2 ) and 0 < e' < e. 

Codebook Generation: 

• Encoder 1 generates a coordination codebook consisting of 2 nRlc codeword^] Ui(li c ), l\ c <G [1 : 2 nRlc ] 

i.i.d. according to J]J =1 P( u ii)- It a I so generates a transmission multicodebook consisting of 2 n ^ Rl+Rl p> 
codewords h p ), rn { 6 [1 : 2 nRl ], l lp e [1 : 2 nRl p] i.i.d. according to Yl™ =1 p(xu). 

• Similarly, encoder 2 generates a coordination codebook consisting of 2 nR2c codewords u 2 (l 2c ), he G 
[1 : 2 nR2c ] i.i.d. according to Y\^ = iP{u 2 i)- It also generates a transmission multicodebook consisting of 
2 n(R 2 +R 2 P ) codewords x 2 (rn 2 ,h P ), tn 2 G [1 : 2 nR 2 ], l 2p G [1 : 2 nIi2p ] i.i.d. according to YYi=iP{ x ^i)- 

Encoding: 

• To transmit message mi, encoder 1 finds a pair (Zi c ,/i p ) such that 

Kih^x^m^hp)) eT^ n) 

and transmits If it cannot find such a pair, it transmits x^^mi, 1). 

. Similarly, to transmit message m 2 , encoder 2 finds a pair (Z 2c , h P ) such that 

(u^hc):X n 2 (m2,l2 P ))eTj n) 

and transmits x 2 (m 2 , l 2p )- If it cannot find such a pair, it transmits x 2 (m 2 ,1). 

The codebook generation and encoding process are illustrated in Fig. [6} 

Decoding: 

• Decoder 1 finds the unique mi such that 

for some (li c ,hp,hc)- If none or more than one such rh\ are found, then decoder 1 declares error. 

. Decoder 2 finds the unique m 2 such that 

K(Z 2c ),x^(m 2 ,Z 2p ),<(Zi c ),^) e r} n) 

for some (Z 2c , l 2p , he)- If none or more than one such m 2 are found, then decoder 2 declares error. 


’Though there is no notion of a common message or a private message in this achievability scheme, we use the subscripts c and p to 
convey if the corresponding random variables are used for decoding at all destinations or only the desired destination respectively. 



Fig. 6: Codebook Generation and Encoding at Encoder 1. The independently generated X\ sequences, lined up vertically in the figure, are 
binned into 2 nRl bins. The independently generated coordination sequences u " are lined up horizontally. To transmit message mi, a jointly 
typical pair (xi,Ui) is sought where x ™ falls into the mi-th bin, and then x" is transmitted. 


Discussion: Before providing the formal analysis of the probability of error to show that the coding 
scheme described above achieves the Han-Kobayashi region, we discuss the connection between the new 
scheme and the scheme from [ 121 which motivates the equivalence of their rate regions. 

Consider the set of codewords used at encoder 1. While this set resembles a multicodebook, it can be 
reduced to a standard codebook (one codeword per message) by stripping away the codewords in each bin 
that are not jointly typical with any of the u’/ sequences, and therefore are never used by the transmitters. 
In other words, after we generate the multicodebook in Fig. [6j we can form a smaller codebook by only 
keeping one codeword per message which is jointly typical with one of the u" sequences (i.e., those 
codewords highlighted in Fig. [6]). Note that this reduced codebook indeed has a superposition structure. 
Each of the 2 nRl remaining codewords x™ is jointly typical with one of the 2 nRlc //j' codewords, and 
when n is large there will be exactly 2 n ( Rl ~ Rlc ' > a;j' sequences that are typical with each «" sequence, 
i.e., these 2 n(,! ' llu -' ) x ” sequences will look as if they were generated i.i.d. from p(x\ |«i ). Therefore, 
the u\ sequences can be indeed thought as the cloud centers in this superposition codebook and a;j'’s as 
the satellite codewords. Therefore, our multicodebook construction can be viewed as an equivalent way 
to generate a superposition codebook as in [331. This reveals that both the codebook structure and the 
decoding in our scheme are similar to that in the Han-Kobayashi scheme and therefore the two achievable 
rate regions are, not surprisingly, equal. 

However, note that for broadcast channels, combining Marton coding (which employs multicoding) [ JT7j | 
with Gelfand-Pinsker coding (which also employs multicoding) is more straightforward than combining 
superposition coding with Gelfand-Pinsker coding. The former has been shown to be optimal in some 
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cases [36J. Since our codebook construction for the interference channel also has the flavor of multicoding, 
extending this construction to setups where multicoding is required is also quite straightforward. As 
mentioned in the introduction, we exploit this to develop simple achievability schemes for more general 
setups described in later sections. 

Probability of Error: Due to the symmetry of the code, the average probability of error P(£) is equal to 
P(E\Mi, M 2 ), so we can assume (Mi,M 2 ) = (1,1) and analyze P(£|l, 1). Let (L lc , L lp , L 2c , L 2p ) denote 
the indices chosen during encoding by encoder 1 and encoder 2 . 

We now define events that cover the event of error in decoding message m\. 


51 = {(U?(hc),X?(l,h p )) £ Tf ] for all Z lc ,Z lp }, 

5 2 4 {(Uf(L lc ),X™(l, Li p ), U£(L 2c ), F”) i T} n) }, 

£3 - Z lp ), U2(L 2c ), Y?) e r} n) for some m 1 f 1, for some l lp }, 

£4 = {(U^(L lc ),X^(m 1 ,l lp ),Uf(l 2 c ),Yf) E T} n) for some m L f 1 , for some l lp ,l 2c }, 
£5 = X?{mi,h p ),U$(L 2 c ),Yf) e T} n) for some m 1 1 , for some Z lp ,Zi c }, 

£e - e T} n) for some m 1 7 ^ 1 , for some l lc ,l lp ,l 2c } 


Consider also the event £{, analogous to £ 1 , which is defined as follows. 

£[ = {(Uf(l 2c ),Xf(l,l 2p )) i for all l 2ci l 2p }. 


Since an error for mi occurs only if at least one of the above events occur, we use the union bound to 
get the following upper bound on the average probability of error in decoding m \: 


P(£i) + P(£() + P(£ 2 n St n E[ c ) + P(£ 3 ) + p(£ 4 ) + P(£ 5 ) + p(£ 6 ). 

By the mutual covering lemma [37, Chap. 8 ], P(£i) —)■ 0 as n —)■ cx) if 

R lp + Ri c >I(U l] X 1 ) + S(e'), 


where S(e') —> 0 as e' -> 0 . 

Similarly, we get that P(£{) —> 0 as n —>■ 00 if 

R 2p + R 2c > I(U 2 ; X 2 ) + S(e'). 

By the conditional typicality lemma, P (£ 2 fl £[' fi Ef) tends to zero as n oc. 

For P(£ 3 ) —>■ 0, we can use the packing lemma from [37, Ch. 3] to get the condition 

Ri + RipKliXi-U^U^Yj-Sie), 


where 5(e) —» 0 as e —» 0 . 

For P(£ 4 ) —> 0, we can again use the packing lemma to get the condition 


( 2 ) 

(3) 

(4) 


Ri + Rip + R2 c < I(Xi, u 2 - Ui, Yi) - 5(e). 


(5) 


For P(£ 5 ) -7 0, we apply the multivariate packing lemma from the Appendix as shown in ( f54[ ) to get 
the condition 


Ri + Rip + Ric < I(Ui\Xi) + I(Ui, Xi-U 2 , Fl) - 5(e). 


( 6 ) 


Finally, for P(£ 6 ) —^0 as n —>■ 00 , another application of the multivariate packing lemma as shown in 
gives the condition 


Ri + Rip + Ric + R 2 c < I(Ui, Xi) + I(U 2 ; Fi) + I(Ui,Xi, U 2 , F 4 ) - 5(e). 


(7) 





10 


A similar analysis leads to the following additional conditions for the probability of error in decoding 


m 2 to vanish as n —> oo. 

B ,2 + R 2p < I(x 2 - U 2 , Ui, Y 2 ) - 5(e), ( 8 ) 

R 2 + R 2p + R lc < I(X 2 , U x -U 2 , Y 2 ) - 5(e), (9) 

R 2 + R 2p + R 2c < I(U 2 ; X 2 ) + I{U 2 , X 2 \ U u Y 2 ) - 5(e), (10) 

R 2 + R 2p + R 2c + R u < I(U 2 ; X 2 ) + I(Ui] Y 2 ) + I(U 2 ,X 2 - U u Y 2 ) - 5(e). (11) 


Hence the probability of error vanishes as n —> oo if the conditions (| 2 |)-([TT|) are satisfied. For the sake 
of brevity, let us first denote the RHS of the conditions (| 2 |)-([TT|) by a, 6 , c, d, e, /, g,h,i, j respectively 
(ignoring the 5(e') and 5(e) terms). 

We then note the following relations among these terms which can be proved using the chain rule of 
mutual information, the Markov chains U\ — X\ — (U 2 . X 2 , Y], Y 2 ) and U 2 — X 2 — (Ui,Xi, Y], Y 2 ) and the 
independence of (Ui,Xi) and (U 2 ,X 2 ). 

e — a < minjc, d}, 
f -a<d< /, 
c < e < /, 

“ (12) 

i — b < minjiyf, h}, 

j - b<h< j, 

9 <i<j- 

We now employ Fourier-Motzkin elimination on the conditions (|2|)-([TT|) and R\ c , R\ p , R 2c , R 2p > 0 to 
eliminate R lc , R lp , R 2c , R 2p . The set of relations ( [12] ) can be used to simplify this task by recognizing 
redundant constraints. At the end, we get the following achievable region: 

R\ < e — a, 

R 2 < i — b, 

R\ -\- R 2 c T - j — cl — 5, 

R\ T R 2 "C (l h — cl — 5, (13) 

R\ + R 2 < ./ + g — cl — b , 

2 Ri + R 2 <c+h + f — 2a — b, 

R\ T 2R 2 d g j — a — 2b. 

Using the same facts as those used to prove ( fT 2 | ), we can show that the above region is the same as 
the Han-Kobayashi region. For the sake of completeness, we show this explicitly. 

• Consider the upper bound on Ri\ 

e — a — I{lh, X i; U 2 ,Y{) 

( =’/(x i; t/ 2 ,y,) 

=’/(A' i; y|C/ 2 ), (14) 

where step (a) follows since U\ — X 1 — (U 2 ,Yi) is a Markov chain, and step ( 6 ) follows since X 1 
is independent of U 2 . 

• Similarly, 


i-b = I{X 2 -Y 2 \U 1 ). 


( 15 ) 
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. Consider the first upper bound on the sum-rate c + j — a — b: 

c + j — a — b 

= /(X l5 Y, U 2 , Y) + /(Y; X 2 ) + I (Ur Y 2 ) 

+ I(U 2 , X 2 ; U u Y 2 ) - I(U 2 ; X 2 ) - /( Y; X,) 

= J(X i; U 2 , Yi\Ui) + I (Ur, Y 2 ) + J(Y, X 2 ; Y, Y 2 ) 

= J(X l5 U 2 , Y| Y) + I (Ur, Y 2 ) + J(X 2 ; Y, Y 2 ) 

= I(Xr, U 2 , Y 1 \U 1 ) + /(Y; Y) + I(X 2 , Y| Y) 

= I(Xr, Y| Y, Y) + J(X 2 , Ur, Y 2 ), (16) 

where step (a) follows by the chain rule of mutual information, step (6) follows by the Markov chain 
U 2 — X 2 — (U \, Y 2 ), step (c) follows since Y and X 2 are independent and step (d) follows by the 
independence of U 2 and (Y, Y)- 

• By similar steps, f + g — a — b = 

I(Xi, U 2 , Y) + J(X 2 ; Y 2 \U ls U 2 ). (17) 

. The remaining upper-bound on the sum-rate d + h — a — b can be simplified as follows: 

d + h — a — b 

= I(X U U 2 , Y, Yi) + I(X 2 , Ur, U 2 , Y 2 ) 

-/(Y;Y)-/(Y;X 2 ) 

= I(X 1 , Y; Y| Y) + I(X 2 , Ur, Y 2 \U 2 ), (18) 

which follows by the chain rule of mutual information and the independence of (Y, Y) and (U 2 , X 2 ). 

• The upper bound on 2Ri + Il> can be simplified as follows: 

c + h + f — 2a — b 

= J(Y; U \, U 2 , Y) + I(X 2 , Ur, U 2 , Y 2 ) + /(Y, Y) 

+ I(U 2 ; Yi) + /(Y, Y; U 2 , Yi) - 21 (Ur, X 1 ) - I(U 2 , X 2 ) 

= J(Y; U 2 , Y| Y) + I(X 2 , Ur, Y 2 \U 2 ) + I(U 2 ; Y) + J(Y, Y; U 2 , Y) 

= /(X i; Y, Y| Y) + /(X 2 , Ur Y 2 \U 2 ) + J(Y; Y) + /(X i; Y| Y) 

= /(x i; YI Y, u 2 ) + J(x 2 , Ur y 2 \u 2 ) + J(Y, Y; Y), (19) 

where step (a) holds by the chain rule of mutual information and the independence of Y and 
(U 2 ,X 2 ), step ( b ) follows by Y — Xi — (U 2 , Y) and the independence of X x and U 2 , and step (c) 
follows by the chain rule of mutual information and the independence of U 2 and (Y,Xi). 

• Finally, d + g + j — a — 2b can be similarly shown to be equal to 

J(X 2 ; Y 2 \Ui, U 2 ) + /(Xi, U 2 , Y| Y) + /(X 2 , Ur, Y 2 ). (20) 

From (|T3|)-([20|) and including a time-sharing random variable 0, we get that the following region is 
achievable: 

R! < I(X 1 ;Y 1 \U 2 ,Q), 

R 2 <I(X 2 ,Y 2 \U l ,Q), 

Ri + R 2 < i(Xr YI Y, u 2 ,Q) + J(x 2 , Y; YIQ), 

Y + R.2 < I{Xi, Y; Y|Y, Q) + /(X 2 , Y; Y|Y, Q), (21) 

Y + Y < /(X l5 Y; Y|Q) + /(X 2 ; Y|Y, Y, Q), 

2Y + Y</(X i; Y|Y,Y,Q) + /(x 2 ,Y;Y|Y,Q) + /(Y, Y; Y|Q), 

Y + 2Y < /(Y; Y| Y, Y, Q) + /(Y, Y; YIY, Q) + /(Y, Y; Y|Q), 
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for pmf p(q)p(ui,xi\q)p(u 2 ,x 2 \q). This region is identical to the region in (jT]). 


IV. State-dependent Interference channels 

In this section, we focus on the particular setup of the state-dependent Z-interference channel (S-D 
Z-IC) with noncausal state information at the interfering transmitter, as depicted in Fig. [2} We provide 
a simple achievability scheme for this setup, that is obtained from the alternative achievability scheme 
for the general interference channel. This scheme is shown to be optimal for the deterministic case. The 
auxiliary random variable used for encoding at the interfering transmitter now implicitly captures some 
part of the message as well as some part of the state sequence realization. The achievability scheme can 


also be viewed as a generalization of the schemes presented in [38] and 221. 

After characterizing the capacity region of the deterministic S-D Z-IC, we investigate a special case in 
detail: the modulo-additive S-D Z-IC. The modulo-additive channel is motivated by the linear deterministic 


model which has gained popularity over the recent years for studying wireless networks [18]. For this case 
(which can be thought of as a linear deterministic model with only one bit level), we obtain an explicit 
description of the capacity region and furthermore, show that the capacity region is also achieved by the 
standard Gelfand-Pinsker coding over the first link and treating interference as noise over the second link. 
Following this, the modulo-additive S-D Z-IC with multiple levels is considered and some discussion is 
provided about the capacity region and the performance of simple achievability schemes. 

To summarize, this section contains the following contributions: 

. An achievable rate region for the S-D Z-IC, 

• Capacity region of the injective deterministic S-D Z-IC, 

. Modulo-additive S-D Z-IC: optimality of treating interference-as-noise and other properties. 


A. Results for the State-Dependent Channel 

The following theorem provides an inner bound to the capacity region of the S-D Z-IC in Fig. [2} 
Theorem 2. A rate pair (f?i, R 2 ) is achievable for the channel in Fig. [5] if 

R^IfJ-Y^-KU-SlQ), 

R 2 < I(X 2 ;Y 2 \V,Q), 

R'2 < I(V, X 2 : Y 2 \Q) - I(V ; SjQ), (22) 

Ri + R 2 < I{u ; YfQ) + /(V, X 2 , Y 2 \Q ) 

-I(U;S\Q)-I(U,S-V\Q), 
for some pmf p(q)p(u,v\s, q)p(x\\u, v, s, q)p(x 2 \q). 

For the injective deterministic S-D Z-IC, we can identify natural choices for the auxiliary random 
variables in Theorem [2] that, in fact, yield the capacity region. This result is stated in the following 
theorem. 


Theorem 3. The capacity region of the injective deterministic S-D Z-IC in Fig. [3] is the set of rate pairs 
(Ri,R 2 ) that satisfy 

Ri^HiY^Q), 

R 2 — h(y 2 \t 1 ,q), 

R 2 <H(Y 2 \Q)-I(T 1 -S\Q ), 

Ri + R2 < H(Y 1 |Ti, 3, Q) + H(Y 2 \Q) - /(Tj; S|Q), 
for some pmf p(q)p(x\\s, q)p(x 2 \q), where \Q\ <4. 


Remark 1. Note that the capacity region remains unchanged even if the first receiver is provided with 


the state information. The proof of this theorem is presented in subsection IV-C 
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B. Proof of Theorem [2] 

Fix p(u,v\s)p(xi\u,v,s)p(x 2 ) and choose 0 < e' < e. 

Codebook Generation: 

• Encoder 2 generates 2 nR2 codewords x 2 (m 2 ), m 2 E [1 : 2 nR2 } i.i.d. according to p(x 2 ). 

. Encoder 1 generates 2 n ^ Rl+R d codewords u n (mi,li ) i.i.d. according to p(u), where m\ E [1 : 2 nRl ] 
and li G [1 : 2 nR 'f. Encoder 1 also generates 2 nR 2 codewords v n (l 2 ),l 2 £ [1 : 2 nR 2] i.i.d. according 
to p(v). 

Encoding: 

• To transmit message m 2 , encoder 2 transmits x 2 (m 2 ) . 

• Assume that the message to be transmitted by encoder 1 is m\. After observing s n , it finds a 
pair (Zi, Z 2 ) such that (u n (m^ If), v n (l 2 ), s n ) E . Then it transmits xf which is generated i.i.d. 
according to p(xi\u,v,s). 

Decoding: 

• Decoder 1 finds a unique rh\ such that Zi), yf) E T} n) for some l\. 

• Decoder 2 finds a unique in 2 such that (x 2 (rh 2 ),v n (l 2 ),y 2 ) E T} n) for some l 2 . 

Probability of Error: Due to the symmetry of the code, the average probability of error P(£) is equal 
to P(£|Mi, M 2 ), so we can assume (Mi, M 2 ) = (1,1) and analyze P(£T| 1 , 1 ). Let (Li, L 2 ) denote the pair 
of indices chosen by encoder 1 such that (U n (l, Lf), V n (L 2 ), S n ) E Tf. 

We now define events that cover the error event: 

s 1 = {(U n (l,h),V n (l 2 ),S n ) i Tj n) for all Z 1 ? Z 2 }, 
£ 2 ±{(U n (l,L 1 ),Y 1 n )fT e ^}, 

£3 — {{U n {m-L,lf),Yf) E for some rn \ 1, Zi}, 

£ 5 4 {(Xf(m 2 ),V n (l 2 ),Y 2 n ) E T} n) for some m 2 f 1 ,l 2 }. 

Since an error occurs only if at least one of the above events occur, we have the following upper bound 
on the average probability of error: 

P(£) < P(£0 + P (£2 n £f) + P(£ 3 ) + P(^ 4 n £f) + P(f 5 ). 

Similar to the proof of the mutual covering lemma [37, Ch. 8 ], we can show that P(£i) —y 0 as n —> 00 
if 


R[ > I(U; S) + 5(e'), 

R' 2 >I(V- S) + 5(e'), 

R[ + R' 2 > I(U ; S) + I(U , V) + <J(e'), 


(24) 

(25) 

(26) 


where 5(d) —* 0 as e' -* 0. 

By the conditional typicality lemma [37, Ch. 2], P(£ 2 n£f) and P(£ 4 n£[ ) both tend to zero as n —y 00 . 
By the packing lemma [37, Ch. 3], for P(£ 3 ) —> 0, we require 


and for P(£ 5 ) —y 0, we require 


R l + R! l <I(U-Y 1 )-5(e), 

R 2 < I(X 2 ,Y 2 \V) - 5(e), 
R 2 + R' 2 <I(V,X 2 -Y 2 )-5(e ), 


(27) 

(28) 
(29) 


where 5(e) —> 0 as e —>■ 0. Hence, P(£) —> 0 if d24] ), ( [25] ), ( [26] ), ((27]), ( [28] ), ( |29] ) are satisfied. Allowing 
coded-time sharing with a time-sharing random variable Q and eliminating R ,\, R' 2 via Fourier-Motzkin 
elimination, we obtain the region 
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C. Proof of Theorem [i] 

Achievability follows from Theorem [2] by choosing U = Y\ and V = T\. These choices are valid since 
encoder 1 knows (Mi, S n ), which determines T” and Yf. We now prove the converse. 

Given a sequence of codes that achieves reliable communication (i.e. P Y —> 0 as n —> oo) at rates 
(Ri,R 2 ), we have, by Fano’s inequality: 

HiMfYf) < ne n , 

H(M 2 \Yf ) < ne n , 

where e n —» 0 as n —> oo. 

Using these, we can establish an upper bound on P x as follows, 

nR 1 = H(M 1 ) 

= H(M 1 \S n ) 

< I {Mi] Yf\S n ) + ne n 
— H(Yf\S n ) + ne n 

n 

<Y H i Y u\Si) + ne n . 

Z— 1 

A simple upper bound on If is established in the following: 

nR 2 = H[M 2 ) 

= H{M 2 \Tf) 

<I(M 2] Y 2 n \T?)+ne n 

<H(Yf\T?)+ne n 

n 

Y H(Y 2i \Tu) + ne n . 

Z— 1 

For the second upper bound on R 2 , consider the following: 

nR 2 = H(M 2 ) 

= H{M 2 ) + H(Y£\M 2 ) - H(Y 2 n \M 2 ) 

= p(y 2 n ) + p(M 2 |y 2 n ) - P(y 2 "|M 2 ) 

n 

< Y H{Y 2i ) + ne, - H{Yf\M 2 ) 

i=l 

n 

= ^ H(Y 2i ) + ne n - P(Tf |M 2 ) 

i=l 

n 

= Y H ( Y ^+ ne n-H(Tf) 

i=l 

n 

< Y H ( Y '2i) + ne n -I(Tf : S n ) 

i=l 

n 

= Y H ( y 2i) + ne n - H(S n ) + H(T?\S n ) 

i=l 

n n 

< Y H ( Y 2 i) + ne n - H(S n ) + Y H(T n \Sf 
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= Y H{Y 2i ) + ne n - Y H(S i ) + Y H^Si) 

2 = 1 2—1 2=1 

n n 

= Y H(Y 2i ) + ne n - ]T /(T H ; 5*) 

2=1 2=1 

(30) 

where step (a) follows by the injectivity property, step ( b ) follows because T" is independent of M 2 , and 
step (c) follows because S n is an i.i.d. sequence. 

We now establish an upper bound on the sum-rate. 

n(R i + R 2 ) = H(Mi\S n ) + H(M 2 ) 

< I (Mi, Tf, Y"|5 n ) + ne n + tf(Y 2 n ) + tf(M 2 |Y 2 ") - tf(Kf|M 2 ) 

< I (Mi, T",Y"|5") + ne n + (Y 2 n ) + ne n - H(Y 2 n \M 2 ) 

< H(T™, 17*|5 n ) + tf(Y 2 n ) - f/(T"|M 2 ) + 2ne n 

= (Tj\ 17(5") + H(Y 2 ) - H(T 7) + 2ne n 
= i7(17|5 n , Tj 1 ) + H(Y 2 n ) - /(Tj 1 ; S n ) + 2ne n 

(c) n n n 

< ^ T u ) + ^ H(Y 2i ) - ^ /(T u ; 5,) + 2ne„ 

2=1 2=1 2=1 

where as before, steps (a), (6) and (c) follow because of injectivity property, independence of Tj" and 
M 2 , and i.i.d. state respectively. 

From the four bounds established in this section, we can complete the converse by introducing an 
independent time-sharing random variable Q uniformly distributed on [1 : n] and defining X], Tj, S, X 2 , 
Y ], Y 2 to be Xi Q , Ti Q , Sq, X 2Q , Y iq , Y 2Q respectively. ■ 

D. Example: Modulo-Additive State-Dependent Z-Interference Channel 

Theorem 4. The capacity region of the modulo-additive S-D Z-IC in Fig. [4] is given by the convex closure 
of the rate pairs (Ri,R 2 ) satisfying 

fli<(l-A)log|*| + A#(p), 

R? < log \X\ — H (Ap + (1 — A)<5 0 ), ' ' 

for some p e Vx , where Vx denotes the probability simplex corresponding to X , H(p) stands for the 
entropy of the pmf p and <5 0 denotes the pmf that has unit mass at 0. 

The capacity region when X = (0,1} and S is i.i.d. Ber(|) is shown in Figure |7j 
Proof of Theorem [?] 

Consider the capacity region stated in Theorem [3] Let p L0 , p x l and p 2 , all in Vx- be used to denote 
the pmf’s p(xi\s = 0 ,q), p(xi\s = 1 ,q) and p(x 2 \q) respectively. Evaluating each of the constraints in 
( |23| ) gives us the following expression for the capacity region: 

Ri<(l-X)H(pi fi ) + XH(pix), 

R 2 < H(p 2 ), 

R 2 <H ((1 - A )p 2 + A p) + A H(pi,i) 

— H (X P i,i + (1 — A)5 0 ), 

Ri + R 2 < (1 - A )H(pi t0 ) +H(( 1 - A )p 2 + A p) 

+ AFf(pi,i) — H (Xp\ t i + (1 — A)5q) , 


( 32 ) 
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Fig. 7: Capacity Region with X = {0,1} and S i.i.d. Ber(^) . The dotted line shows the capacity region when all nodes have state 
information. Note that the maximal sum-rate of 1.5 bits/channel use is achievable with state information only at the interfering Tx. 


where p e Vx is a pmf that is defined as 

l*|-i 

p(k) = ^2 Pi,i{i)P2{k -i), 0 < k <\X\- 1, 

i=0 


and k — i should be understood to be (k — i) mod \X\. 

Firstly, we note that pi >0 should be chosen as the pmf of the uniform distribution to maximize II 
thus maximizing the RHS of the constraints in ( |32] ). Similarly, p 2 should also be chosen to be the pmf 
of the uniform distribution. Then, we can also remove the first constraint on R 2 , since it is rendered 
redundant by the other constraint on R 2 . Thus, the capacity region is given by the convex closure of 
(Ri,R 2 ) satisfying 

Ri < (l-A) log(|*|) + A. H(p ltl ), 

R 2 < log(\X\) + A H(p 1}1 ) - H (A p 1A + (1 - X)6 0 ), 

R l + R 2 <(2~\)log(\X\) + \H(p 1 , 1 ) 1 j 

— H (Api,! + (1 — A)S 0 ) , 


for p 1A e V x - 

For any p, the region in ( |3T[ ) is contained in the region in ( |33j ) for p ] A = p. Hence, the convex closure 
of ( pTj ) is contained in the convex closure of ( |33| ). 

However, also note that the region in ( |33| ) for any p\ A is contained in the convex hull of two regions, one 
obtained by setting p = pi,i in ( |3T| ) and the other obtained by setting p = in ( pTj ). Hence, the convex 
closure of ( |33| ) is also contained in the convex closure of ([37]). This concludes the proof of Theorem [4]B 


Remark 2. The optimal sum-rate (2 — A) log \ X\ is achieved by choosing p = Sq. This corresponds to 
setting the transmitted symbols of the first transmitter to 0 when S' = 1 so that it does not interfere 
with the second transmission. The first transmitter then treats these symbols as stuck to 0 and performs 
Gelfand-Pinsker coding. The second transmitter transmits at rate log(|T’|) bits/channel use. It can be easily 
verified that this is also the optimal sum-rate when all nodes are provided with the state information. Thus, 
for this channel, the sum-capacity when all nodes have state information is the same as that when only 
encoder 1 has state information. 


Remark 3. Finally, we note that there is also another way to achieve the capacity region of the modulo 
additive S-D Z-IC. For this, first recall that to get the capacity region expression in Theorem [3] we set the 
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auxiliary random variables U and V in the expression in Theorem [2] to Y\ and Ti respectively. Another 
choice, which corresponds to standard Gelfand-Pinsker coding for the first transmitter-receiver pair and 
treating interference as noise at the second receiver is to choose V = </> in Theorem |2j This gives us the 
following achievable region: 

Ri < I(X 2 ;Y 2 \Q), ' ’ 

for some pmf p(q)p(u\s, q)p(xi\u, s, q)p(x 2 \q). We can now see that for the modulo-additive S-D Z-IC, 
the capacity region is also achieved by making the following choices in the above region: p{u\s = 0) to 
be the uniform pmf over X, p(u\s = 1) to be p, p(xi\u,s) to be S u (i.e. X\ = U ) and p(x 2 ) to be the 
uniform pmf over X. Thus, the capacity region of the modulo-additive S-D Z-IC can also be achieved by 
treating interference as noise at the second receiver. 


E. Multiple-level modulo-additive S-D Z-IC 

The linear deterministic model introduced in [ 181 consists of multiple bit levels that roughly correspond 
to bits communicated at different power levels. The modulo-additive S-D Z-IC that we looked at in the 
previous subsection is a special case in which the number of levels is one. Extending the model to have 
multiple bit levels raises some interesting questions which we consider in this subsection. 

More specifically, consider the model depicted in Fig. [8j which can be thought of as three copies of 
the model in Fig. [4j which are however related by the common state affecting them. For simplicity, we 
restrict attention to the case when the alphabet on each level, denoted by X, is the binary alphabet, i.e. 
{0,1}, and the state is Ber(0.5). Fet L denote the number of bit levels. 


Mi 


M 2 


gn 



Mi 


M 2 


Fig. 8: The Modulo-Additive S-D Z-IC wit multiple bit levels. 


This model also falls under the injective-deterministic setup for which we have completely characterized 
the capacity region. So the capacity region can be easily computed, as we indeed do in the following. 
This evaluation also allows us to immediately compare the capacity region with the rates achieved by 
some straightforward achievability schemes that we can employ. In particular, consider the following two 
simple achievability schemes: 

• “Separation”: The simplest strategy one can employ is to separately consider each level and commu¬ 
nicate over it independently of the other levels. This gives us that the rate pairs (-Ri,-R 2 ) satisfying 


A < \ + E 

i =1 
L 

R*<L-Y,H{pi + 8 o), 

1=1 


( 35 ) 


for some p 1 ,p 2 , • • • ,Pl € 'Px are achievable. 
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Fig. 9: Comparison of the different rate regions for 2-level binary modulo-additive S-D Z-IC 



Fig. 10: Comparison of the different rate regions for 3-level binary modulo-additive S-D Z-IC 


• “Communicate state”: Alternatively, by noticing that strictly better rates could have been achieved 
if decoder 2 also had access to the state information, we can reserve one level to communicate the 
state from encoder 1 to decoder 2. This is done by ensuring that encoder 1 transmits a 1 on this 
reserved level whenever the state is 1, and encoder 2 constantly transmits a 0 on this level. The nodes 
communicate on the remaining levels keeping in mind that now decoder 2 also has state information. 
Note that while no communication can happen between encoder 2 and decoder 2 on the reserved 
level, encoder 1 can still communicate with decoder 1 at rate 0.5 on this level by treating it as a 
channel with stuck bits (bit equals 1 whenever state equals 1). This strategy provides us the following 
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achievable region: 

R i< 2 + 2 ^^’ 

R 2 < L — 1 - (p), 


( 36 ) 


for some p e V x l-i. 

We can expect that the suboptimality of reserving one level for communicating the state should become 
relatively small as the number of levels increases i.e. at high SNR. This is corroborated by the numerical 
analysis, shown in Figs. [9] and [TO} in which we can see that there is a marked improvement in the rates 
achieved by this scheme relative to the capacity region as we increase the number of levels from 2 to 3. 
Indeed, since all the levels are affected by the same state, the entropy of the state becomes small compared 
to the communication rates as the SNR increases, so it is not a big overhead to explicitly communicate 
the state to decoder 2 at high SNR. However, at low SNR, the figures show that the overhead incurred 
is quite high due to which this approach is significantly suboptimal, while the simple scheme of treating 
the levels separately results in achieving very close to the entire capacity region. 


V. Interference Channels with Partial Cribbing 

In this section, we focus on deterministic Z-interference channels when the interfering transmitter can 
overhear the signal transmitted by the other transmitter after it passes through some channel. This channel 


is also modeled as a deterministic channel, dubbed as partial cribbing in [311. Deterministic models, in 


particular linear deterministic models [18], have gained popularity due to the observation that they are 
simpler to analyze and are provably close in performance to Gaussian models. 

There have been quite a few very sophisticated achievability schemes designed for interference channels 
with causal cribbing encoders, however optimality of the achievable rate regions has not been addressed. 


In the most general interference channel model with causal cribbing [29], each encoder needs to split 
its message into four parts: a common part to be sent cooperatively, a common part to be sent non- 
cooperatively, a private part to be sent cooperatively and a private part to be sent non-cooperatively. 
Further, because of the causal nature of cribbing, achievability schemes usually involve block-Markov 
coding, so that each encoder also needs to consider the cooperative messages of both encoders from the 
previous block. Motivated by the alternative achievability scheme we have presented earlier for the general 
interference channel, we present a simple optimal achievability scheme that minimizes the rate-splitting 
that is required. Specifically, while encoder 2 only splits its message into a cooperative and non-cooperative 
private part, encoder 1 does not perform any rate-splitting at all. By focusing on the specific configuration 
of the Z-interference channel, we are able to prove the optimality of an achievability scheme that is 
simpler than the highly involved achievability schemes for the general case that are currently known. 


A. Result for Partial Cribbing 

Theorem 5. The capacity region of the injective deterministic Z-interference channel with unidirectional 
partial cribbing, depicted in Fig. [5j is given by the convex closure of (If. If) satisfying 

Ri<H(Y l \W), 

R -2 < min (h(Y 2 ),H(Y 2 ,Z 2 \T 1 ,W)'), (37) 

Ri + R 2< H{Y x \T u w ) + min (h{Y 2 ), H(Y 2 , Z 2 \W )), 

for p(w)p(x 1 \w)p(x 2 \w) , where W is an auxiliary random variable whose cardinality can be bounded as 
|W| < 1341 + 3. 

The proof of this theorem is presented below. 
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B. Proof of Theorem [5] 

Achievability 

Choose a pmf p(w)p(ud,u c , xi\w)p(x 2 , z 2 \w) and 0 < e' < e, where for the sake of generality, we use the 
auxiliary random variables Ud and U c . In the injective deterministic case at hand, they can be set to Yj 
and Xj respectively. 

Codebook Generation: The communication time is divided into B blocks, each containing n channel 
uses, and an independent random code is generated for each block b £ [1 : B], Whenever it is clear from 
the context, we suppress the dependence of codewords on b to keep the notation simple. The messages 
in block B are fixed apriori, so a total of B — 1 messages are communicated over the B blocks. The 
resulting rate loss can be made as negligible as desired by choosing a sufficiently large B. 

We split R 2 as R' 2 + Rif which corresponds to the split of message 2 into two parts, one that will be 
sent cooperatively by both transmitters to receiver 2 and the other non-cooperatively only by transmitter 2 
to receiver 2. For each block b, let m' 2 b £ [1 : 2 nR f and m 2b £ [1 : 2 ni?2 ]. For each block b £ [1 : B], we 
generate 2 nR ' 2 sequences w n i.i.d. according to p(iu). 

• For each w n in block b, we generate 2 nR 2 sequences [z 2 (w n , m 2 6 )} i.i.d. according to p(z 2 \w). Then 
for each (w n ,z 2 ), we generate 2 nR 'i sequences [x 2 (w n , z 2 , m 2 6 )} i.i.d. according to p(x 2 \z 2 ,w). 

• For each w n in block b, we generate 2 nRc sequences {u^(w n ,l c )} i.i.d. according to p(u c \w), where 

l c £ [1 : 2 nRc ]. We also generate 2 n( ' Rl+Rd '> sequences Id)} i.i.d. according to p(u d ), where 

m 1)b £ [1 : 2 nRl ] and l d £ [1 : 2 nRd }. [] 

Encoding: Let us assume for now that as a result of the cribbing, encoder 1 knows m 2 b _ 1 at the end 
of block 6 — 1. Then in block 6, both encoders can encode rnl 2 b _ , using v: n (rnl 2 b _ ,) where w n is from 
the code for block 6. 

. To transmit message encoder 1 finds a pair (l d , If] such that 


(w n (m' 


2 , 6—1 / 5 




(n) 


It transmits x 2 that is generated i.i.d. according to p(xi\w,Ud,u c ). 

• To transmit message rn 2 }, = ( m' 2 b ,m 2b ), encoder 2 encodes m' 2b as z 2 (w n ,m 2b ) and then transmits 
x 2 (w n ,zf m" b ). 


We fix apriori the messages in block B to be m | j-> = 1, m 2 B = 1 and m 2 B = 1. Also, to avoid mentioning 
edge cases explicitly, whenever m 10 , m' 20 or m 20 appear, we assume that all are fixed to 1 . 

Decoding: 

• Encoder 1: At the end of block 6 , assuming it has already decoded m 2 h _, at the end of block 6—1, 
encoder 1 decodes m 2 b by finding the unique rh 2 b such that the sequence z 2 it has observed via 


°2,b 

cribbing is equal to z 2 (w n ,m 2b ). 


Decoder 1: In each block 6 , decoder 1 finds the unique such that (u%(m ltb ,l d ),yi) e Te (n) for 
some l d . 

Decoder 2: Decoder 2 performs backward decoding as follows: 


- In block B, decoder 2 finds a unique m 2 B _ i such that the condition ([38]) is satisfied for some 

lr. 


4,1). e 7? n) 


(38) 


- In block 6 , assuming m 2b has been decoded correctly, it finds the unique (rnl 2b ^ such 
that the condition ( |39| ) is satisfied for some l c . 


w n (m: 


2 , 6 - 1 ) 5 Z 2 


w n , m 2 b ) 


xZ(wfzfm"),u n c (w n ,l c ),yZ) £ 


(39) 


4 Note that the u% sequences are generated independently of the w n sequences. 
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Probability of Error: To get a vanishing probability of error, we can impose the conditions 
in the following list. 

Ch. 8], we can show that the 


Similar to the proof of the mutual covering lemma [37 


conditions are sufficient for the success of encoding at the first transmitter: 


described 

following 


R d > I{U d \W)+ &{<!), (40) 

R d + R c >I(U d -,U c ,W)+5(e'). (41) 

. For the decoding at encoder 1 to succeed: 

R’ 2 < H(Z 2 \W) -5(e). (42) 

• For decoding at decoder 1 to succeed: 

i2 1 + i2 d </(£7 d ;ri)-<J(e). (43) 


. For the backward decoding at decoder 2 to succeed, it is sufficient that the following conditions are 
satisfied: 


R 2 < I{X 2 - Y 2 \W, U c , Z 2 ) - 5(e), (44) 

K + Rc< I(U C ,X 2 ; Y 2 \W, Z 2 ) - 5(e), (45) 

R' 2 + K + R C < I(W, U c , X 2 - Y 2 ) - 5(e) . (46) 


Noting that R' 2 + R! 2 
and substituting Ud = 
bound on R \: 


= R' 2 , eliminating (77,/. R c , R 2 , R”) from 
Y\ and U c — T), we get the achievable region in ([37]) with the following additional 


via Fourier-Motzkin elimination, 


R x < HiY^Tf) + H(Y 2 \W,Z 2 ). 


To conclude the proof of achievability, we show that this bound is rendered redundant by Ri < H(Y\\W) 
which can be proved by the following chain of inequalities: 


H(Y 1 \W,T 1 ) + H{Y 2 \W,Z 2 ) > H(Y 1 \W,T 1 ) + H(Y 2 \W,X 2 ) 

= H(Y 1 \W,T 1 ) + H(T 1 \W,X 2 ) 
= H{Y 1 \W,T 1 ) + H{T 1 \W) 

= H(Y 1 ,T 1 \W) 

> 77(Yi|VF). 


Converse 

We now establish the converse. By Fano’s inequality, we have the following two relations that are satisfied 
by any sequence of codes that achieve reliable communication: 

H(M 1 117) < ne n , H(M 2 \Y 2 n ) < ne n , 

where e n —>■ 0 as n —> oo. 

First, an upper bound on Ri is established in ( |47| ). 

nR l = H(Mf) 

= H(Mi\Z 2 ) 

< I (Mi; Yf\Z 2 ) + ne n 

< H(Yf\Z 2 ) + ne n 


2—1 


= Y,H{Y li \W i ) + ne n 
2—1 


( 47 ) 




where W* = Z 2 1 . 

Next, we establish two bounds on R 2 , the first one in ([48]) as follows: 
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and the second one in (49) below: 


nR 2 = H{M 2 ) 

<I(M 2 ,Y 2 n )+ne n 
— H(Y 2 ) + ne n 

Tl 

< ^ H(Y 2i ) + 7ie n , 

i— 1 


nR 2 = H{M 2 \M 1 ) 

= H(M 2 , Z 2 \Mi) 

= H(Z 2 \Mi) + H(M 2 \M u Z 2 n ) 

= H(Z 2 \Mx) + H(M 2 \M U Z 2 n ,T”) 

< H{Zl\M x ) + I(M 2 , r 2 n |M l5 Z”, T") + ne„ 

< ^(^) + ^(y 2 n |^,7r)+ne n 


< ^ ^ H(Y 2i \W u T u , Z 2t ) + ne n , 

i= 1 2=1 

?2 n 

= 5^ ^(^ 2 i| Wi, Tii) + ^ | Wi, T u , Z 2j ) + ne n , 

2=1 2=1 


(48) 


— ' H(Y 2 j : Z 2 j\Wj,T\j) + ne n , 

2=1 


(49) 


where step (a) follows because T," is a function of A"[' which is a function of (M { . Z 2 ), and step (b) 


follows because Z 2i — W t — T u . 


Finally, we establish two bounds on the sum-rate, the first one in (50) below: 


n(Ri + R 2 ) 

= H{M X \Z™) + H{M 2 , Z”) 

= H(Mi, T™\Z 2 ) + H(Z%) + H(M 2 \Z 2 ) 

= H(T?\Z%) + H(Mi\T™, Z 2 n ) + H(Z%) + H(M 2 \Z%) 

< H{Y 2 \X 2 , Z 2 ) + I (Mi, Y?\T?, Zl 2 ") + H(Z%) + I(M 2 , Y? |Z”) + ne n 

< #071*2, ^ 2 n ) + #07W, Z 2 ) + H(Z 2 ) + H(Y?\Z%) - H(Y 2 \M 2 , Z 2 ) + ne n 

= H(Y 2 \X 2 , Z 2 ) + Z 2 ) + H(Z%) + H(Y 2 \Z 2 ) - H(Y 2 n \M 2 , X 2 , Z”) + ne 

= H(Y?\X?, Z 2 ) + H(Y?\T?, Z 2 n ) + H(Z 2 ) + H(Y 2 n \Z%) - H(Y 2 n \X%, Z?) + ne n 
= H(Y?\T T, Z 2 ) + H(Z 2 ) + H(Y?\Z%) + ne n 
= H(Y?\T [*, Z 2 ) + H(Y 2 n , Z 2 n ) + ne n 


< ^ ^(^1^, Wi) + ^ i?(r 2i , Z 2i |Wi) + ne n , 


( 50 ) 
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where step (a) uses the fact that H(T™\Z%) < H(Y™\X%, Z 2 ), which is proved below: 

H(T?\Z%) < H(T") 

= H(Y 2 n \X%) 

= H(Y?\XZ,Z2) 1 


and step (b) follows because M 2 — (X 2 , Z 2 


-V! 


The second bound on the sum-rate is established in (51) as follows: 


n(R\ + R 2 ) 

= H(Mi, T™\Z 2 ) + H(M 2 ) 

< H(T[ l \Z%) + H(M 1 \T^, Z%) + I (M2] Y 2 ) + ne n 

< H(Y 2 \X 2 , Z 2 ") + I (Mi, Y?\T?, Z n 2 ) + H(Y 2 n ) - H(Y 2 n \M 2 ) + ne n 

< H{Y 2 \X 2 , Z%) + H(Y?\T?, Z%) + H(Y?) - H(Y?\X%, Z%) + ne n 
— H(Y™\T™, Z 2 ) + H(Y 2 ) + ne n 


< h ( y ^\ t ^ w i)+ H{ y^ + ne - ^ 51) 

2—1 2—1 

In (g7)-(|5T), we can introduce a time-sharing random variable Q uniformly distributed on [1 : n\. 
Defining W to be (Wq,Q) and (X 1 , X 2 ,Y 1: Y 2 ) to be (X 1Q , X 2Q ,Y 1Q ,Y 2Q ), we get the required bounds 
on the rates. 

We also require the Markov relationship X\ — W — X 2 to be satisfied. Since W t is chosen to be Z'if\ 
we immediately have X u — W, — X 2i and hence X 1 — W — X 2 . Finally, the bound on the cardinality of 
W can be established using the standard convex cover method. ■ 


VI. Extensions 


A. State-Dependent Z-channel 


The result in Theorem [3] can be extended easily to Z-channels in which transmitter 1 also wishes to 
communicate a message M 2 \ at rate R 2 \ to receiver 2, as shown in Fig. [IT 


Mi , M 2 i 


M 2 


gn 


1 



Mi 


M 2 , M 21 


Fig. 11: The Injective Deterministic S-D Z-C 
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Theorem 6. The capacity region of the injective deterministic state-dependent Z-channel is the set of rate 
pairs (Ri, R 2 i, R 2 ) that satisfy 


Ri <H(Y 1 \S,Q), 

R2<H{Y 2 \T 1 ,Q), 

R2i<H(T 1 \S,Q), 

Ri + R21<H(T 1 ,Y 1 \S,Q), 

R2 + R21<H(Y 2 \Q)~I(T 1 -S\Q), 

R 1 + R 2 + R'2 1 < H(Y 1 |Ti, S, Q ) + if (y 2 |Q) - /(7i; S|Q), 


for some pmf p{q)p(xi\s,q)p(x 2 \q), where \Q\ < 6 . 


The achievability scheme for this case is similar to the achievability scheme for the Z-IC described in 
Section IV-C| except that now the v n sequences at encoder 1 are also binned, and this bin-index corresponds 
to the message M 2 1 , that is to be communicated from transmitter 1 to receiver 2. The converse can be 
established by following similar steps as the converse for the Z-IC. 


Remark 4. Since broadcast channel and multiple-access channel are special cases of the Z-channel, 
Theorem [ 6 ] also provides the capacity region for the deterministic broadcast channel and the injective 
deterministic multiple-access channel. 


B. State-dependent injective deterministic Z-IC with unidirectional partial cribbing 


Mi 


M 2 


S n 


1 






Fig. 12: State-Dependent Injective Deterministic Z-Interference Channel with Unidirectional Partial Cribbing 


To illustrate further the advantages of the multicoding scheme, we consider a model that combines 
the state-dependent Z-IC and the Z-IC with unidirectional partial cribbing, as depicted in Fig. [12 
can combine the achievability schemes for the two component setups from Sections |IV-C and |V-B 


straightforward manner to get an achievability scheme for this setup. It turns out that this is capacity- 
achieving, resulting in the following theorem. 


Theorem 7. The capacity region of the channel in Fig. 12 is given by the convex closure of rate pairs 
(Ri,R 2 ) satisfying 


R^HiYWW.S) 

R 2 <H(Y 2 ,Z 2 \T 1 ,W) 

R '2 < min (H(Y 2 ), H(Y 2 , Z 2 \W)) - /(T i; SjW) 

Ri + R 2 < H(Yi\W, Tj, S) + min (H(Y 2 ), H(Y 2 , Z 2 \W)) - J(T i; S\W) 
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for pmf of the form p(w)p(x 2 \w)p(xi\w, s ). 

The proof of the converse combines ideas from the converse proofs we have presented for the state-only 
case and the cribbing-only case. 


VII. Discussion 

Note that there is one difference between the multicoding-based achievability scheme for the canonical 
interference channel and the achievability schemes for the settings in Sections [IV] and [V| For the former, 
the codebook associated with the auxiliary random variable U is used at both receivers during decoding, 
whereas for the cribbing setup, the codebook associated with auxiliary random variable U d is only used 
at the desired receiver, while that associated with the auxiliary random variable U c is only used at the 
undesired receiver. One way to understand this dichotomy is to observe similarities with the inner bound 
for broadcast channel which combines Marton coding and superposition coding, given in [ [37] Proposition 
8.1] which involves three auxiliary random variables U 0 , U\, U 2 . Here, the random variable U 0 is used at 
both receivers during decoding, while U\ and U 2 are used only at the respective receivers. Now for the 
deterministic cribbing setup that we have considered, we can think that in the optimal scheme, Uq can be 
set to be the empty random variable 0, and U\ and U 2 correspond to U d and U c respectively, i.e., there is 
no superposition coding, only Marton coding is used (with the distinction from usual Marton coding that 
the set of U™ sequences is not binned). The situation is similar for deterministic and semideterministic 
broadcast channels where Uq = 0 is optimal too. On the other hand, the Han-Kobayashi scheme employing 
superposition coding can be thought of as setting U 2 to 0, and U 0 and U\ correspond to U and X 
respectively (no Marton coding, only superposition coding). The key observation is that in the Han- 
Kobayashi scheme, it is not necessary to think of U as encoding a part of the message explicitly, which 
can be exploited to view the superposition coding instead in a manner resembling Marton coding, as we 
have shown in Section [TTl] (again with the distinction from usual Marton coding that the set of sequences 
corresponding to the auxiliary random variable is not binned). This clarifies the dichotomy mentioned 
at the beginning of this paragraph. Alternatively, we can understand both ways of decoding in a unified 
manner, as in [39]. 
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Appendix 

Multivariate Packing Lemma 

Lemma 1. Consider the following four assumptions. 

(A) Let {U,Xi,X 2 ,. .. ,X k ,Y} be random variables that have some joint distribution pnx t ,x 2 ,-,x k ,y- 

(B) Let ( U n ,Y n ) ~ p(u n ,y n ) be a pair of arbitrarily distributed sequences. 

(C) For each j G [1 : K ], let {A0'(m ? ), rrij e Aj}, where \Aj\ < 2 nR \ be random sequences each 
distributed according to n”=i 

(D) For each j e [1 : K] and each rrij, assume that X-' (m. ; ) is pairwise conditionally independent 
of (..., X?_ 1 (m j -i),X? +1 (m j+1 ),..., Y n ^j given U n for all (... , rrij- 1 , m J+ i,...), but arbitrarily 
dependent on other AJ(-) sequences. 

Then there exists 6(e) that tends to zero as e —> 0 such that 

P((iT,X?(m 1 ),X2(m 2 ),...,XZ(m K ),Y n ) e T} n) 
for some mi e Ai, m 2 G A 2 ,, m K € Ak) 
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tends to 0 as n —» oo if 


K 


K 


Rj < Y H ( X j\ U ) - H ( X 1 . X 2 , • • •, A* |C7, y) - 5(e) 


j =i i =1 


(52) 


Proof: The proof follows on similar lines as that of the packing lemma in [37 


Ch. 3]. 


Consider a fixed tuple (mi, m 2 , ■ ■ •, m j <)• The chain of inequalities resulting in ( [53] ) bounds the prob¬ 
ability of [U n , {Xf(rhj)}f ;=1 ,Y n ) being jointly typical, where (a), ( b ) and (c) follow from assumptions 


3 V"JJi3= 

(C) and (D). Then we can apply the union bound over all possible tuples (%,%, • • • ,fh K ) to get the 


condition (52). 


p ( 


(™i)}J=l, 

g 7; (n) ) 


= 

E 




[u n ,y n )&T^ n) 




(a) 

E 

p(fi B ,J/ B )P((fi B ,^(m 1 ),^(m a ) ) . 

••,X£(m*)J n ) G 


(u n ,y n )eTe n 




(6) 

E 


p{x 1 , ■ 





T e (n) (x 1 ,x: 2 ,...,x if |s’*,^) 


(y) 



I\ 


E 

p(fi n ,iT) 

y 

"I*”) 


(u n ,y n )er} n 


T e (n) (x:i,.Y2,...,.YK|w n ,y n ) J=1 

I< 

< 

E 

P (ufy n ) 




(u n ,y n )£T e (n) 


I< 

3 = 1 

< 

E 

p(u n , y n ) 2 n ( H ( x Y-XK\ u , Y )+s(e)) JJ 2 - 

n(r/(Xi|C/)-5(e)) 


(u n ,y n )eT e (n) 


3 = 1 





K 


= 

2 n(H(X 1 ,...,X K IU,Y)+S(e)) 2 ~ n ( H ( X 3 . 





3= 1 



U n = u n , Y n = y n 


U n = u n 


(53) 


Example 1 

For the case U 


0 and random variables X 1; X 2 ,Y, the condition ( [52] ) can be expressed as follows: 


Ri + R 2 < H(X 1 ) + H(X 2 ) - H(X U X 2 \Y) - 5(e) 
= I(Xi] X 2 ) + I{X U X 2 ; Y) - 5(e). 


(54) 


Example 2 
For the case U 


and random variables Xi,X 2 , X 3 , Y, the condition (52) can be expressed as follows: 


R 1 + R 2 + R 3 < H{Xf) + H(X 2 ) + H(X 3 ) 

— H(Xi,X 2 , X 3 \Y) — 5(e) 
= I(Xp, X 2 ) + I(X 3 , Y) 

+ I(Xi, X 2 ; X 3 , Y) — 5(e). 


( 55 ) 
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